NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

ARDuP: Active Region Video Diffusion for Universal Policies

https://doi.org/10.1109/IROS58592.2024.10802264

Huang, Shuaiyi; Levy, Mara; Jiang, Zhenyu; Anandkumar, Anima; Zhu, Yuke; Fan, Linxi; Huang, De-An; Shrivastava, Abhinav (October 2024, IEEE)

Full Text Available
LEAP: Liberate Sparse-view 3D Modeling from Camera Poses

Jiang, Hanwen; Jiang, Zhenyu; Zhao, Yue; Huang, Qixing (May 2024, International Conference on Learning Representations)

Full Text Available
Learning Generalizable Manipulation Policies with Object-Centric 3D Representations

Zhu, Yifeng; Jiang, Zhenyu; Stone, Peter; Zhu, Yuke (November 2023, Conference on Robot Learning)

We introduce GROOT, an imitation learning method for learning robust policies with object-centric and 3D priors. GROOT builds policies that generalize beyond their initial training conditions for vision-based manipulation. It constructs object-centric 3D representations that are robust toward background changes and camera views and reason over these representations using a transformer-based policy. Furthermore, we introduce a segmentation correspondence model that allows policies to generalize to new objects at test time. Through comprehensive experiments, we validate the robustness of GROOT policies against perceptual variations in simulated and real-world environments. GROOT's performance excels in generalization over background changes, camera viewpoint shifts, and the presence of new object instances, whereas both state-of-the-art end-to-end learning methods and object proposal-based approaches fall short. We also extensively evaluate GROOT policies on real robots, where we demonstrate the efficacy under very wild changes in setup.
more » « less
Full Text Available
Ditto in the House: Building Articulation Models of Indoor Scenes through Interactive Perception

https://doi.org/10.1109/ICRA48891.2023.10161431

Hsu, Cheng-Chun; Jiang, Zhenyu; Zhu, Yuke (May 2023, 2023 IEEE International Conference on Robotics and Automation (ICRA))

Full Text Available
Ditto: Building Digital Twins of Articulated Objects from Interaction

https://doi.org/10.1109/CVPR52688.2022.00553

Jiang, Zhenyu; Hsu, Cheng-Chun; Zhu, Yuke (June 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR))

Full Text Available
ACID: Action-Conditional Implicit Visual Dynamics for Deformable Object Manipulation

Shen, Bokui; Jiang, Zhenyu; Choy, Christopher; Savarese, Silvio; Guibas, Leonidas J.; Anandkumar, Anima; Zhu, Yuke (June 2022, Robotics: Science and Systems XVIII)

Full Text Available

Search for: All records